Noor et al. BMC Systems Biology 201 2, 6:1 40 
http://www.biomedcentral.eom/1 752-0509/6/1 40 



^BMC 

Systems Biology 



RESEARCH ARTICLE 



A proof for loop-law constraints in 
stoichiometric metabolic networks 



Open Access 



Elad Noor 1 ", Nathan E Lewis 23 " and Ron Milo 1 



Abstract 

Background: Constraint-based modeling is increasingly employed for metabolic network analysis. Its underlying 
assumption is that natural metabolic phenotypes can be predicted by adding physicochemical constraints to remove 
unrealistic metabolic flux solutions. The loopless-COBRA approach provides an additional constraint that eliminates 
thermodynamically infeasible internal cycles (or loops) from the space of solutions. This allows the prediction of flux 
solutions that are more consistent with experimental data. However, it is not clear if this approach over-constrains the 
models by removing non-loop solutions as well. 

Results: Here we apply Gordan's theorem from linear algebra to prove for the first time that the constraints added in 
loopless-COBRA do not over-constrain the problem beyond the elimination of the loops themselves. 

Conclusions: The loopless-COBRA constraints can be reliably applied. Furthermore, this proof may be adapted to 
evaluate the theoretical soundness for other methods in constraint-based modeling. 
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Background 

Constraint-based modeling has become a successful 
framework for the analysis of large and complex stoichio- 
metric biochemical networks [1]. The underlying concept 
of this framework is that one can use the stoichiometry 
of each reaction in a reconstructed metabolic network 
[2,3] and known bounds on reaction fluxes to compute 
metabolic flux for each reaction. These predictions repre- 
sent allowable steady-state metabolic flux distributions in 
a cell under a given growth condition. Some example con- 
straints include mass balance and metabolite uptake rates. 
One set of constraints that has been more challenging 
to implement are those associated with thermodynamic 
limitations. Without thermodynamic constraints, non- 
physical fluxes can be computed for some metabolic reac- 
tions if they produce an internal cycle (Figure la). Such 
cycles of reactions violate a "loop law" that is analogous 
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to KirchhofFs second law for electrical circuits, as dis- 
cussed previously by Beard et al. [4]. Many approaches 
have successfully constrained these loops using known 
flux directionality [5], energy-balance equations [4], and 
known [6-11] or predicted [12] thermodynamic parame- 
ters. Loops have also been indirectly removed by minimiz- 
ing network flux [6,13-15], or by coupling flux to enzyme 
synthesis costs [16]. 

A new approach, called loopless-COBRA, was recently 
presented [17]. Unlike previous loop-removal algorithms, 
this method does not necessitate extra inputs or data, such 
as metabolite concentrations or thermodynamic param- 
eters. Basically, this method imposes the second law of 
thermodynamics by using a mixed-integer linear pro- 
gramming (MILP) approach to constrain flux solutions so 
that they obey the loop law. Thus, flux solutions from this 
method are all within the portion of the flux space that is 
devoid of loops (Figure lb). Excitingly, the loop removal 
improved the consistency of the simulation results [17] 
with experimental data [18]. Specifically, it provides more 
realistic flux values for reactions that normally contribute 
to loops in a model. Otherwise, flux predictions for such 
reactions would usually have to be ignored in any subse- 
quent analysis. 
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Figure 1 Loop law constraints on metabolic networks, (a) 

Metabolic network reconstructions frequently have sets of reactions 
that cycle all metabolites internally. The fluxes of these reactions are 
therefore unconstrained, (b) Metabolic network solutions are found 
within a convex space, which is enclosed by known constraints on 
metabolite inputs, outputs, and known fluxes. Loops result in 
unconstrained dimensions in the solution space (blue). By 
implementing loopless-COBRA constraints, all loop-containing 
solutions are removed, leaving only solutions that do not contain 
loops (orange). 



The loopless-COBRA method was shown to work 
in various scenarios, and the paper that presented the 
approach provides an explanation for why this method 
works. However, there is no mathematical proof for its 
formulation as an optimization problem. Specifically, it 
does not demonstrate that the additional MILP con- 
straints do not over-constrain the problem and eliminate 
some non-loop containing solutions. Since constraint- 
based methods attempt to only eliminate impossible in 
silico phenotypes (i.e., steady-state flux distributions that 
the cell cannot maintain), it is important to verify that 
solutions representing real phenotypes are not removed 
by accidentally over-constraining the problem. 

Here, we address this issue by presenting a mathemat- 
ical proof for the completeness and soundness of the 
loopless-COBRA method, thereby adding fundamental 
support and rigorous proof for the constraints presented 
by Schellenberger et al. [17]. 



Results and discussion 

Formal definition of loop-law constraints 

In loopless-COBRA, the constraints added to the linear 
problem are: 

Gi < 0 for all v; > 0 
Gi > 0 for all v; < 0 
Gi e R for all v { = 0 
N int G = 0 

where v; are the flux variables and Ni nt is the null- space 
matrix of Si nt (the stoichiometric matrix of internal reac- 
tions). The third constraint (G; e R) is not actually a 
constraint, but a way to say that Gi can have any value if 
Vi = 0. We can rewrite all these constraints succinctly as: 

3GeR n s.t. null(S/„j)-G=0A (V/sign(G/) = -sign(v/)Wi = 0) 

(1) 

Formal definition of loops 

In order to prove that this constraint eliminates loops (and 
only loops), we must first find a mathematical formula- 
tion for a loop, using the same notation as above. We thus 
define a loop as a nonzero vector x e M. n which satisfies 
the mass-balance equation for the internal reactions, i.e. 
Si nt • x = 0. This means that although there is a nonzero 
net flux in some of the reactions, overall, the internal net- 
work is at steady-state (an obvious violation of the second 
law of thermodynamics). It is important to point out, that 
this equation for defining loops must not be confused with 
the steady-state assumption commonly used in flux bal- 
ance analysis models, namely S • v = 0, where the full 
stoichiometric matrix (S) is used. 

According to this definition, a flux distribution (v e W 1 ) 
will contain a loop if and only if there exists a vector x e 
M. n \ {0} which is consistent with the flux directions in v 
(i.e. Xi is either zero or has the same sign as v/) and is itself 
a loop (i.e. Sim -x = Q). Formally, v has a loop if and only if: 

3x e R n \ {0} s.t. (V/ signfe) e {sign(v;), 0}) A S int -x = 0 

(2) 

We have now finished laying the groundwork for our 
mathematical proof that loopless-COBRA is sound and 
complete. In order to do that, we are left only to show 
that Equation 1 is satisfiable if and only if Equation 2 is 
unsatisfiable (in other words, there are no loops). 

Gordan's theorem 

We start our proof by quoting Gordans theorem: For all 
A e R mxn exactly one of the following two statements is 
true: 

(a) 3*eR^\{0} s.t. Ax = 0 

(b) 3y e R m s.t. A T y > 0 
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We will show that statement (a) in Gordans theorem 
is equivalent to having a loop (Equation 2) and statement 
(b) is equivalent to the MILP constraints used by loopless- 
COBRA (Equation 1). After doing so, we would easily 
reach the conclusion of the proof. 

As a guidance for the following sections, one can see 
that statement (a) already resembles Equation 2 if we 
define A = Si nt . The only difference is that x is con- 
strained to have only non-negative values (note the V 
in R" ), instead of being consistent with the sign of v. 
Corollaries 1 and 2 will show how we can overcome 
this discrepancy by defining A in a slightly different 
way. 

At first glance, statement (b) might look unrelated to 
Equation 1. However, in the last part of our proof, we 
show that choosing G e R n , which satisfies null(S/ W f) 
G = 0, is the same as choosing y e M m and then 
taking G = Sj nt • y. Only for sake of understanding 
the algebra, one can think of y as the vector of forma- 
tion Gibbs energies, and of G as the vector of reaction 
Gibbs energies. The rest of the proof, like for statement 
(a), deals with adjusting A to fit with the non-positive 
values in v. The toy example in Figure 2 shows how 
Gordans theorem corresponds to having or not having 
a loop, for a network with 3 compounds and 3 internal 
reactions. 



Corollary 1 

For all A e R mxn and d e {-1, 0, l} n exactly one of the 
following two statements is true: 

(la) ^gR h \ {0} s.t. (V/ signfe) e {d h 0}) A Ax = 0 
(lb) 3y e R m s.t. (yi sign(A~ Y y) i = d t v d t = o) 

Proof First, define a new matrix A that is the same as 
A, without the columns corresponding to d{ = 0 and 
where columns corresponding to d{ = — 1 are multiplied 
by —1. Statement (la) is true for A if and only if (a) is true 
for A. The forward direction is easily shown by remov- 
ing the zeros from x, where d{ = 0, and negating values 
corresponding to d{ = —1 (as previously done for A). 
Reversing this process (i.e. taking a positive solution for 
Ax — 0, adding back the zeros and negating the same 
values) shows the other direction is true as well. 

Likewise, statement (lb) for A is true if and only if (b) is 
true for A, since columns with d{ = —1 are negated in A 
and thus sign(A T j/); = 1. Columns with d{ = 0 have no 
other constraints in (lb) and the same goes for (b) since 
they are removed from A. 

Therefore, Corollary 1 is directly derived from Gordans 
theorem. □ 



%2 %4 




Ax = 0 X4 ~~ l ^<0 




A&O " A T y > 0 

Figure 2 Illustrative example for Corollary 2. This example shows a small network with 3 internal reactions (X2-4). The flux directions were 
chosen according to the direction of the arrows. The matrix A is the internal stoichiometric matrix, (a) A flux distribution is shown where all 3 
internal reactions are active and form a loop. Therefore there is a solution (xj = X3 = X4 = 1 ) for the mass balance equation Ax = 0. In this case, no 
solution exists for A T y > 0. Therefore this flux distribution will be eliminated by loopless-COBRA. (b) A loopless flux distribution, in this case X4 is not 
active. There is no solution for Ax = 0 (except for the trivial solution x = 0). Gordan's theorem claims that there must be a solution for A T y > 0, e.g. 
the one shown in the figure. Thus, loopless-COBRA will not eliminate any such flux distributions. 
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Since constraint-based models usually use a vector of 
real values (v e R n ) to represent the flux distribution, 
we subsequently change the formulation of the Corollary 
slightly to match. 

Corollary 2 

For all A e R mxn and v e R n , exactly one of the following 
two statements is true: 

(2a) 3xeR n \ {0} s.t. (V/ signfe) e {sign(v/), 0}) A Ax = 0 
(2b) 3y e R m s.t. (V/ sign(A T ^)/ = -sign(Vf) V v* = o) 

Proo/ Defining d{ = sign(v/), we get this directly from 
Corollary 1. Note that — sign(v/) can be used in (2b), since 
the existence of y is equivalent to the existence of —y. □ 

This adjustment now allows us to apply Corollary 2 
to constraint-based problems and show that it eliminates 
loops (see example in Figure 2). In order to avoid any 
confusion, we point out that a solution for Ax = 0 is 
considered a loop only if A is the stoichiometric matrix 
of internal reactions. This should not be mistaken as the 
steady-state mass-balance equation which looks exactly 
the same, except that A contains both internal and exter- 
nal reactions. 

Corollary 3 

Adding the following constraint: 

3y e R m s.t. (v/sign(A T y),- = -signfa) v v* = o) 

is equivalent to eliminating all loops in a flux distribution 

v. 

Proof Using Corollary 2, all that must be shown is that 
statement (2a) is equivalent to having a loop. This is appar- 
ent since x is a vector in the null-space of A (i.e., a loop) 
and is consistent with the flux direction of v in each of its 
nonzero reactions. □ 

Note that the trivial case v = 0 can still be a solution and 
it should be explicitly avoided if necessary. 

Applying Corollary 3 in loopless-COBRA 

The added constraints in loopless-COBRA [17] are 
slightly different than in Corollary 3, namely: 

3GeR n s.t. null(A)-G = OA(Visign(Gi) = -sign(vi)vvi = 0) 

We claim here that both formulations are equivalent. 
The fundamental theorem of linear algebra states that the 
nullspace, null (A), is the orthogonal complement of the 
row space, image(A T ). Therefore, we can say that null(A)- 



G = 0 if and only if G e image (A T ), so we can rewrite the 
constraint above as: 

3G e image(A T ) s.t. (Vi sign(G;) = -sign(v/) v v/ = 0) 

which is obviously equivalent to the constraint in Corol- 
lary 3. 

Conclusions 

Our results prove that the constraints proposed by Schel- 
lenberger, et al. [17] eliminate all flux solutions with loops 
and nothing more. This alleviates the concern as to if 
the loopless-COBRA constraints might eliminate true flux 
states. Furthermore, values in G are analogous to the 
change in Gibbs energy (A r G) of the reactions [17], and y 
values are analogous to the chemical potentials (or forma- 
tion energies, AyG) of the compounds themselves. Since 
in most cases, there are fewer compounds than reactions 
(m < n), we believe that it is convenient and intuitive to 
use the new formulation. 

In conclusion, this proof provides theoretical credibil- 
ity for the loopless-COBRA constraint. However, as with 
any algorithmic MILP implementation, care must still 
be taken with respect to numerical limitations and the 
convergence of the optimization algorithm. 

Lastly, we believe this proof may be extended to similar 
methods addressing loop elimination. We also hope that 
similar proofs will appear for other methods, since more 
rigorous mathematical treatments are needed in many 
published algorithms in computational biology to prove or 
disprove their correctness. 
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